Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Combining neural-based approaches and linguistic knowledge for text recognition in multimedia documents

Identifieur interne : 000148 ( Main/Exploration ); précédent : 000147; suivant : 000149

Combining neural-based approaches and linguistic knowledge for text recognition in multimedia documents

Auteurs : Khaoula Elagouni [France]

Source :

RBID : Hal:tel-00864923

Descripteurs français

English descriptors

Abstract

This thesis focuses on the recognition of textual clues in images and videos. In this context, OCR (optical character recognition) systems, able to recognize caption texts as well as natural scene texts captured anywhere in the environment have been designed. Novel approaches, robust to text variability (differentfonts, colors, sizes, etc.) and acquisition conditions (complex background, non uniform lighting, low resolution, etc.) have been proposed. In particular, two kinds of methods dedicated to text recognition are provided:- A segmentation-based approach that computes nonlinear separations between characters well adapted to the localmorphology of images;- Two segmentation-free approaches that integrate a multi-scale scanning scheme. The first one relies on a graph model, while the second one uses a particular connectionist recurrent model able to handle spatial constraints between characters.In addition to the originalities of each approach, two extra contributions of this work lie in the design of a character recognition method based on a neural classification model and the incorporation of some linguistic knowledge that enables to take into account the lexical context.The proposed OCR systems were tested and evaluated on two datasets: a caption texts video dataset and a natural scene texts dataset (namely the public database ICDAR 2003). Experiments have demonstrated the efficiency of our approaches and have permitted to compare their performances to those of state-of-the-art methods, highlighting their advantages and limits.

Url:


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Combining neural-based approaches and linguistic knowledge for text recognition in multimedia documents</title>
<title xml:lang="fr">Combinaison d'approches neuronales et de connaissances linguistiques pour la reconnaissance de texte dans les documents multimédias</title>
<author>
<name sortKey="Elagouni, Khaoula" sort="Elagouni, Khaoula" uniqKey="Elagouni K" first="Khaoula" last="Elagouni">Khaoula Elagouni</name>
<affiliation wicri:level="1">
<hal:affiliation type="researchteam" xml:id="struct-176633" status="VALID">
<orgName>CIDER</orgName>
<orgName type="acronym">IRISA</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
<listRelation>
<relation active="#struct-419362" type="direct"></relation>
<relation active="#struct-105128" type="indirect"></relation>
<relation active="#struct-105160" type="indirect"></relation>
<relation active="#struct-172265" type="indirect"></relation>
<relation active="#struct-247362" type="indirect"></relation>
<relation active="#struct-300009" type="indirect"></relation>
<relation active="#struct-300812" type="indirect"></relation>
<relation name="- RENNES" active="#struct-301232" type="indirect"></relation>
<relation active="#struct-301262" type="indirect"></relation>
<relation name="UMR6074" active="#struct-441569" type="indirect"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-419362" type="direct">
<org type="department" xml:id="struct-419362" status="VALID">
<orgName>SYSTÈMES LARGE ÉCHELLE</orgName>
<orgName type="acronym">IRISA-D1</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">https://www.irisa.fr/fr/departements/d1-systemes-large-echelle</ref>
</desc>
<listRelation>
<relation active="#struct-105128" type="direct"></relation>
<relation active="#struct-105160" type="indirect"></relation>
<relation active="#struct-172265" type="indirect"></relation>
<relation active="#struct-247362" type="indirect"></relation>
<relation active="#struct-300009" type="indirect"></relation>
<relation active="#struct-300812" type="indirect"></relation>
<relation name="- RENNES" active="#struct-301232" type="indirect"></relation>
<relation active="#struct-301262" type="indirect"></relation>
<relation name="UMR6074" active="#struct-441569" type="indirect"></relation>
</listRelation>
</org>
</tutelle>
<tutelle active="#struct-105128" type="indirect">
<org type="laboratory" xml:id="struct-105128" status="VALID">
<idno type="IdRef">026386909</idno>
<idno type="ISNI">0000 0001 2298 7270</idno>
<idno type="RNSR">200012163A</idno>
<orgName>Institut de Recherche en Informatique et Systèmes Aléatoires</orgName>
<orgName type="acronym">IRISA</orgName>
<date type="start">2000</date>
<desc>
<address>
<addrLine>Avenue du général LeclercCampus de Beaulieu 35042 RENNES CEDEX</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.irisa.fr</ref>
</desc>
<listRelation>
<relation active="#struct-105160" type="direct"></relation>
<relation active="#struct-172265" type="direct"></relation>
<relation active="#struct-247362" type="direct"></relation>
<relation active="#struct-300009" type="direct"></relation>
<relation active="#struct-300812" type="direct"></relation>
<relation name="- RENNES" active="#struct-301232" type="direct"></relation>
<relation active="#struct-301262" type="direct"></relation>
<relation name="UMR6074" active="#struct-441569" type="direct"></relation>
</listRelation>
</org>
</tutelle>
<tutelle active="#struct-105160" type="indirect">
<org type="institution" xml:id="struct-105160" status="VALID">
<orgName>Université de Rennes 1</orgName>
<orgName type="acronym">UR1</orgName>
<desc>
<address>
<addrLine>2 rue du Thabor - CS 46510 - 35065 Rennes cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-rennes1.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-172265" type="indirect">
<org type="institution" xml:id="struct-172265" status="VALID">
<orgName>Université de Bretagne Sud</orgName>
<orgName type="acronym">UBS</orgName>
<desc>
<address>
<addrLine>BP 92116 - 56321 Lorient cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-ubs.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-247362" type="indirect">
<org type="institution" xml:id="struct-247362" status="VALID">
<orgName>École normale supérieure - Rennes</orgName>
<orgName type="acronym">ENS Rennes</orgName>
<desc>
<address>
<addrLine>Campus de Ker Lann - avenue Robert Schuman - 35170 Bruz</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.ens-rennes.fr</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-300009" type="indirect">
<org type="institution" xml:id="struct-300009" status="VALID">
<orgName>Institut National de Recherche en Informatique et en Automatique</orgName>
<orgName type="acronym">Inria</orgName>
<desc>
<address>
<addrLine>Domaine de VoluceauRocquencourt - BP 10578153 Le Chesnay Cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/en/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-300812" type="indirect">
<org type="institution" xml:id="struct-300812" status="VALID">
<orgName>SUPELEC</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle name="- RENNES" active="#struct-301232" type="indirect">
<org type="institution" xml:id="struct-301232" status="VALID">
<orgName>Institut National des Sciences Appliquées</orgName>
<orgName type="acronym">INSA</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle active="#struct-301262" type="indirect">
<org type="institution" xml:id="struct-301262" status="VALID">
<orgName>Télécom Bretagne</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle name="UMR6074" active="#struct-441569" type="indirect">
<org type="institution" xml:id="struct-441569" status="VALID">
<idno type="IdRef">02636817X</idno>
<idno type="ISNI">0000000122597504</idno>
<orgName>Centre National de la Recherche Scientifique</orgName>
<orgName type="acronym">CNRS</orgName>
<date type="start">1939-10-19</date>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.cnrs.fr/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
<placeName>
<settlement type="city">Rennes</settlement>
<region type="region" nuts="2">Région Bretagne</region>
</placeName>
<orgName type="university">Université de Rennes 1</orgName>
<orgName type="institution" wicri:auto="newGroup">Université européenne de Bretagne</orgName>
<placeName>
<settlement type="city">Lorient</settlement>
<region type="region" nuts="2">Région Bretagne</region>
</placeName>
<orgName type="university">Université de Bretagne-Sud</orgName>
<orgName type="institution" wicri:auto="newGroup">Université européenne de Bretagne</orgName>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">HAL</idno>
<idno type="RBID">Hal:tel-00864923</idno>
<idno type="halId">tel-00864923</idno>
<idno type="halUri">https://tel.archives-ouvertes.fr/tel-00864923</idno>
<idno type="url">https://tel.archives-ouvertes.fr/tel-00864923</idno>
<date when="2013-05-28">2013-05-28</date>
<idno type="wicri:Area/Hal/Corpus">000031</idno>
<idno type="wicri:Area/Hal/Curation">000031</idno>
<idno type="wicri:Area/Hal/Checkpoint">000048</idno>
<idno type="wicri:Area/Main/Merge">000150</idno>
<idno type="wicri:Area/Main/Curation">000148</idno>
<idno type="wicri:Area/Main/Exploration">000148</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en">Combining neural-based approaches and linguistic knowledge for text recognition in multimedia documents</title>
<title xml:lang="fr">Combinaison d'approches neuronales et de connaissances linguistiques pour la reconnaissance de texte dans les documents multimédias</title>
<author>
<name sortKey="Elagouni, Khaoula" sort="Elagouni, Khaoula" uniqKey="Elagouni K" first="Khaoula" last="Elagouni">Khaoula Elagouni</name>
<affiliation wicri:level="1">
<hal:affiliation type="researchteam" xml:id="struct-176633" status="VALID">
<orgName>CIDER</orgName>
<orgName type="acronym">IRISA</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
<listRelation>
<relation active="#struct-419362" type="direct"></relation>
<relation active="#struct-105128" type="indirect"></relation>
<relation active="#struct-105160" type="indirect"></relation>
<relation active="#struct-172265" type="indirect"></relation>
<relation active="#struct-247362" type="indirect"></relation>
<relation active="#struct-300009" type="indirect"></relation>
<relation active="#struct-300812" type="indirect"></relation>
<relation name="- RENNES" active="#struct-301232" type="indirect"></relation>
<relation active="#struct-301262" type="indirect"></relation>
<relation name="UMR6074" active="#struct-441569" type="indirect"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-419362" type="direct">
<org type="department" xml:id="struct-419362" status="VALID">
<orgName>SYSTÈMES LARGE ÉCHELLE</orgName>
<orgName type="acronym">IRISA-D1</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">https://www.irisa.fr/fr/departements/d1-systemes-large-echelle</ref>
</desc>
<listRelation>
<relation active="#struct-105128" type="direct"></relation>
<relation active="#struct-105160" type="indirect"></relation>
<relation active="#struct-172265" type="indirect"></relation>
<relation active="#struct-247362" type="indirect"></relation>
<relation active="#struct-300009" type="indirect"></relation>
<relation active="#struct-300812" type="indirect"></relation>
<relation name="- RENNES" active="#struct-301232" type="indirect"></relation>
<relation active="#struct-301262" type="indirect"></relation>
<relation name="UMR6074" active="#struct-441569" type="indirect"></relation>
</listRelation>
</org>
</tutelle>
<tutelle active="#struct-105128" type="indirect">
<org type="laboratory" xml:id="struct-105128" status="VALID">
<idno type="IdRef">026386909</idno>
<idno type="ISNI">0000 0001 2298 7270</idno>
<idno type="RNSR">200012163A</idno>
<orgName>Institut de Recherche en Informatique et Systèmes Aléatoires</orgName>
<orgName type="acronym">IRISA</orgName>
<date type="start">2000</date>
<desc>
<address>
<addrLine>Avenue du général LeclercCampus de Beaulieu 35042 RENNES CEDEX</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.irisa.fr</ref>
</desc>
<listRelation>
<relation active="#struct-105160" type="direct"></relation>
<relation active="#struct-172265" type="direct"></relation>
<relation active="#struct-247362" type="direct"></relation>
<relation active="#struct-300009" type="direct"></relation>
<relation active="#struct-300812" type="direct"></relation>
<relation name="- RENNES" active="#struct-301232" type="direct"></relation>
<relation active="#struct-301262" type="direct"></relation>
<relation name="UMR6074" active="#struct-441569" type="direct"></relation>
</listRelation>
</org>
</tutelle>
<tutelle active="#struct-105160" type="indirect">
<org type="institution" xml:id="struct-105160" status="VALID">
<orgName>Université de Rennes 1</orgName>
<orgName type="acronym">UR1</orgName>
<desc>
<address>
<addrLine>2 rue du Thabor - CS 46510 - 35065 Rennes cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-rennes1.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-172265" type="indirect">
<org type="institution" xml:id="struct-172265" status="VALID">
<orgName>Université de Bretagne Sud</orgName>
<orgName type="acronym">UBS</orgName>
<desc>
<address>
<addrLine>BP 92116 - 56321 Lorient cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-ubs.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-247362" type="indirect">
<org type="institution" xml:id="struct-247362" status="VALID">
<orgName>École normale supérieure - Rennes</orgName>
<orgName type="acronym">ENS Rennes</orgName>
<desc>
<address>
<addrLine>Campus de Ker Lann - avenue Robert Schuman - 35170 Bruz</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.ens-rennes.fr</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-300009" type="indirect">
<org type="institution" xml:id="struct-300009" status="VALID">
<orgName>Institut National de Recherche en Informatique et en Automatique</orgName>
<orgName type="acronym">Inria</orgName>
<desc>
<address>
<addrLine>Domaine de VoluceauRocquencourt - BP 10578153 Le Chesnay Cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/en/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-300812" type="indirect">
<org type="institution" xml:id="struct-300812" status="VALID">
<orgName>SUPELEC</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle name="- RENNES" active="#struct-301232" type="indirect">
<org type="institution" xml:id="struct-301232" status="VALID">
<orgName>Institut National des Sciences Appliquées</orgName>
<orgName type="acronym">INSA</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle active="#struct-301262" type="indirect">
<org type="institution" xml:id="struct-301262" status="VALID">
<orgName>Télécom Bretagne</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle name="UMR6074" active="#struct-441569" type="indirect">
<org type="institution" xml:id="struct-441569" status="VALID">
<idno type="IdRef">02636817X</idno>
<idno type="ISNI">0000000122597504</idno>
<orgName>Centre National de la Recherche Scientifique</orgName>
<orgName type="acronym">CNRS</orgName>
<date type="start">1939-10-19</date>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.cnrs.fr/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
<placeName>
<settlement type="city">Rennes</settlement>
<region type="region" nuts="2">Région Bretagne</region>
</placeName>
<orgName type="university">Université de Rennes 1</orgName>
<orgName type="institution" wicri:auto="newGroup">Université européenne de Bretagne</orgName>
<placeName>
<settlement type="city">Lorient</settlement>
<region type="region" nuts="2">Région Bretagne</region>
</placeName>
<orgName type="university">Université de Bretagne-Sud</orgName>
<orgName type="institution" wicri:auto="newGroup">Université européenne de Bretagne</orgName>
</affiliation>
</author>
</analytic>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="mix" xml:lang="en">
<term>Classification connexionniste</term>
<term>Modèle de graphes</term>
<term>Modèle de langues</term>
<term>Réseau de neurones</term>
<term>Scanning multi-échelles</term>
</keywords>
<keywords scheme="mix" xml:lang="fr">
<term>Reconnaissance de caractères</term>
<term>Reconnaissance de texte</term>
<term>Segmentation</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">This thesis focuses on the recognition of textual clues in images and videos. In this context, OCR (optical character recognition) systems, able to recognize caption texts as well as natural scene texts captured anywhere in the environment have been designed. Novel approaches, robust to text variability (differentfonts, colors, sizes, etc.) and acquisition conditions (complex background, non uniform lighting, low resolution, etc.) have been proposed. In particular, two kinds of methods dedicated to text recognition are provided:- A segmentation-based approach that computes nonlinear separations between characters well adapted to the localmorphology of images;- Two segmentation-free approaches that integrate a multi-scale scanning scheme. The first one relies on a graph model, while the second one uses a particular connectionist recurrent model able to handle spatial constraints between characters.In addition to the originalities of each approach, two extra contributions of this work lie in the design of a character recognition method based on a neural classification model and the incorporation of some linguistic knowledge that enables to take into account the lexical context.The proposed OCR systems were tested and evaluated on two datasets: a caption texts video dataset and a natural scene texts dataset (namely the public database ICDAR 2003). Experiments have demonstrated the efficiency of our approaches and have permitted to compare their performances to those of state-of-the-art methods, highlighting their advantages and limits.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>France</li>
</country>
<region>
<li>Région Bretagne</li>
</region>
<settlement>
<li>Lorient</li>
<li>Rennes</li>
</settlement>
<orgName>
<li>Université de Bretagne-Sud</li>
<li>Université de Rennes 1</li>
<li>Université européenne de Bretagne</li>
</orgName>
</list>
<tree>
<country name="France">
<region name="Région Bretagne">
<name sortKey="Elagouni, Khaoula" sort="Elagouni, Khaoula" uniqKey="Elagouni K" first="Khaoula" last="Elagouni">Khaoula Elagouni</name>
</region>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000148 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000148 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     Hal:tel-00864923
   |texte=   Combining neural-based approaches and linguistic knowledge for text recognition in multimedia documents
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024